Skip to content

Conversation

@azreenz
Copy link
Collaborator

@azreenz azreenz commented Jan 7, 2026

Prints out azslurm version using azslurm version

@azreenz azreenz force-pushed the azreenzaman/azslurm-version branch from 3ce565c to 5be26bc Compare January 7, 2026 19:55
@azreenz
Copy link
Collaborator Author

azreenz commented Jan 7, 2026

Prints out azslurm version using azslurm version

root@slurm-25-5-scheduler:/opt/azurehpc/slurm/venv/lib64/python3.11/site-packages/slurmcc# azslurm version
azslurm 4.0.4

shows subcommand in --help output

root@slurm-25-5-scheduler:/opt/azurehpc/slurm/venv/lib64/python3.11/site-packages/slurmcc# azslurm -h
usage:
    accounting_info      -
    buckets              - Prints out autoscale bucket information, like limits etc
    config               - Writes the effective autoscale config, after any preprocessing, to stdout
    connect              - Tests connection to CycleCloud
    cost                 - Cost analysis and reporting tool that maps Azure costs to SLURM Job Accounting data. This is an experimental feature.
    debug_cluster_status - Dumps cluster/{cluster_name}/status call to stdout for support and debug purposes.
    default_output_columns - Output what are the default output columns for an optional command.
    initconfig           - Creates an initial autoscale config. Writes to stdout
    keep_alive           - Add, remove or set which nodes should be prevented from being shutdown.
    limits               -
    nodes                - Query nodes
    partitions           - Generates partition configuration
    refresh_autocomplete - Refreshes local autocomplete information for cluster specific resources and nodes.
    remove_nodes         - Removes the node from the scheduler without terminating the actual instance.
    resume               - Equivalent to ResumeProgram, starts and waits for a set of nodes.
    resume_fail          - Equivalent to SuspendFailProgram, shutsdown nodes
    retry_failed_nodes   - Retries all nodes in a failed state.
    return_to_idle       - Nodes that fail to resume in ResumeTimeout seconds will be left in a down~ state - i.e. down and powered_down. It is also possible the nodes will be in a drained~ state, if the node was drained during resume. This command will set those nodes to idle~.  The one exception is for nodes that have KeepAlive set in CycleCloud. Those nodes will be left as down~ and will be logged. When the user unclicks the KeepAlive, the node can be automatically shutdown if --terminate-zombie-nodes is set, or config["return-to-idle"]["terminate-zombie-nodes"] is true.
    scale                - Create or update slurm partition and/or gres information
    shell                - Interactive python shell with relevant objects in local scope. Use --script to run python scripts
    suspend              - Equivalent to SuspendProgram, shutsdown nodes
    topology             - Generates Topology Plugin Configuration
    version              - Print azslurm version
    wait_for_resume      - Wait for a set of nodes to converge.

positional arguments:
  {accounting_info,buckets,config,connect,cost,debug_cluster_status,default_output_columns,initconfig,keep_alive,limits,nodes,partitions,refresh_autocomplete,remove_nodes,resume,resume_fail,retry_failed_nodes,return_to_idle,scale,shell,suspend,topology,version,wait_for_resume}

options:
  -h, --help            show this help message and exit

@azreenz azreenz marked this pull request as ready for review January 7, 2026 19:58
@azreenz azreenz merged commit d120cd5 into master Jan 9, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants